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Assistant Commissioner for Patents 
Box: New Patent Application 
Washington, DC 20231 

fl Re: Patent Application 

^ Our File No, 0496747-0019 

S Transmittal Letter 

111 Dear Sir: 

L Please find enclosed a patent application as follows: 

11 Applicant(s) Christopher J. Stakutis Kevin M. Steams 

\q Title: METHODS AND APPARATUS FOR HIGH-SPEED ACCESS TO 

a AND SHARING OF STORAGE DEVICES ON A NETWORKED 

DIGITAL DATA PROCESSING SYSTEM 

1) This Transmittal Letter. 

2) Specification (24 pp., mcl. cover sheet), Clauns (13 pp.), Abstract (1 pg.) and 
Drawings (4 sheets); 

3) Declaration and Power of Attorney (unexecuted); 

4) A Verified Statement (Declaration) Claiming Small Entity Status (unexecuted); 

5) Assignment (unexecuted); and 

6) Two return postcards. 

Please do not record the enclosed unexecuted Assignment, nor charge a fee for such recordal. 
If this application is found otherwise to be INCOMPLETE, or if at any time it appears that a 
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TELEPHONE CONFERENCE with counsel would helpfully advance prosecution, please telephone the 
undersigned at any time. 

Kindly acknowledge receipt of the foregoing application by returning the self-addressed 
postcards. 



ATTORNEY'S OF APPLICANT 
y Choate, Hall & Stewart 
^ Exchange Place 
a 53 State Street 
5 Boston, MA 02109-2891 
i Telephone: (617) 248-5000 
LI! Telefax: (617) 248-4000 



Respectfully submitted. 
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Applicants: Christopher J. Stakutis Attorney's Docket No.: 0496747-0019 

Kevin M. Steams 

Serial No.: 09/002,266 

For: METHODS AND APPARATUS FOR HIGH-SPEED ACCESS TO AND 

SHARING OF STORAGE DEVICES ON A NETWORKED DIGITAL DATA 
PROCESSING SYSTEM 

VERIFIED STATEMENT (DECLARATION) CLAIMING SMALL ENTITY 
STATUS (37 CFR 1.9 (F) AND 1.27 (D)) - SMALL BUSINESS CONCERN 

I hereby declare that I am: 

( ) the owner of the small business concern identified below: 

( X ) an official of the small business concern empowered to act on behalf of the 

concern identified below: 

NAME OF CONCERN Mercury Computer Systems. Inc. 

ADDRESS OF CONCERN 199 Rivemeck Road Chelmsford, MA 01824 



I hereby declare that the above-identified small business concern qualifies as a small business 
concern as defined in 13 CFR 121.3-18, and reproduced in 37 CFR 1.9(d), for purposes of 
paying reduced fees under section 41(a) and (b) of Title 35, United States Code, in that the 
number of employees of the concern, including those of its affiliates, does not exceed 500 
persons. For purposes of this statement, (1) the number of employees of the business concern 
is the average over the previous fiscal year of the concern of the persons employed on a full- 
time, part-time or temporary basis during each of the pay periods of the fiscal year, and (2) 
concerns are affihates of each other when either, directly or indirectly, one concern controls 
or has the power to control the other, or a third party or parties controls or has the power to 
control both. 

I hereby declare that rights under contract or law have been conveyed to and remain with the 
small business concern identified above with regard to the invention, entitled METHODS 
AND APPARATUS FOR HIGH-SPEED ACCESS TO AND SHARING OF STORAGE 
DEVICES ON A NETWORKED DIGITAL DATA PROCESSING SYSTEM by the above 
named applicants described in 

( x) the specification filed herewith 

( ) application serial no. 

( ) patent no. , issued 
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If the rights of the above-identified small business concern are not exclusive, each individual, 
concern or organization having rights to the invention is listed below* and no rights to the 
invention are held by any person, other than the inventor(s), who could not qualify as a small 
business concern under 37 CFR 1.9(d) or by any concern which would not quahfy as a small 
business concern under 37 CFR 1.9(d) or a nonprofit organization under 37 CFR 1.9(e), 
*NOTE: Separate verified statements are required from each named person, concern or 
organization having rights to the invention averring to their status as small entities, (37 CFR 
1.27) 

FULL NAME 

ADDRESS 

( ) Individual ( ) Small Business Concern ( ) Nonprofit Organization 

FULL NAME 

ADDRESS 

( ) Individual ( ) Small Business Concern ( ) Nonprofit Organization 

I acknowledge the duty to file, in this application or patent, notification of any change in 
status resulting in loss of entitlement to small entity status prior to paying, or at the time of 
paying, the earliest of the issue fee or any maintenance fee due after the date on which status 
as a small entity is no longer appropriate. (37 CFR 1.28(b)) 

I hereby declare that all statements made herein of my own knowledge are true and that all 
statements made on information and belief are beheved to be true; and further that these 
statements were made with the knowledge that willful false statements and the like so made 
are punishable by fine or imprisonment, or both, under section 1001 of Title 18 of the United 
States Code, and that such willful false statements may jeopardize the validity of the 
appHcation, any patent issuing thereon, or any patent to which this verified statement is 
directed. 

NAME OF PERSON SIGNING 

TITLE OF PERSON 

ADDRESS OF PERSON 



SIGNATURE DATE 
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Background of the Invention 

This application is a continuation of United States Patent Application Serial No. 
09/002,266 filed on December 31, 1997, the teachings of which are incorporated herein by 
reference. 

The invention pertains to digital data processing and, more particularly, to high-speed 
access to and sharing of disk drives and other storage devices on a networked digital data 
processing system. The invention has application, for example, in accessing and sharing 
video, graphics and other large data files on a networked computer system. 

In early computer systems, long-term data storage was typically accomplished via 
dedicated storage devices, such as tape and disk drives, connected to a data central computer. 
Requests to read and write data generated by applications programs were processed by 
special-purpose input/output routines resident in the computer operating system. With the 
advent of "time sharing" and other early multiprocessing techniques, multiple users could 
simultaneously store and access data - albeit only through the central storage devices. 

With the rise of the personal computer (and workstation) in the 1980's, demand by 
business users led to development of interconnection mechanisms that permitted otherwise 
independent computers to access on one another's storage devices. Though computer 
"networks" had been known prior to this, they typically permitted only communications, not 
storage sharing. 

Increases in the power of the personal computer is opening ever more avenues for their 
use. Video editing applications, for example, have until recently demanded specialized video 
production systems. Now, however, such applications can be run on high-end personal 
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computers. By coupling these into a network, multiple users can share and edit a single video 
work. 

Alas, network infrastructures have not kept pace with the computers which they 
connect. Though small data files can be transferred and shared quite effectively over 
conventional network interconnects, such as ethemet, these do not lend themselves to sharing 
and transferring large files. Thus, although users are accustomed to seemingly instantaneous 
file access over a network, it can take over an hour to transfer a 60 sec. video file that is 1.2 
GBytes in length. 

The prior art has developed interconnects that permit high-speed transfers to storage 
devices. The so-called fiber chaimel, for example, affords transfers at rates of up to 100 
MBytes/sec ~ more than two orders of magnitude faster than conventional network 
interconnects. Although a single storage device may sport multiple fiber channel interfaces, 
no system has been developed to permit those workstations to share files on that storage 
device. 

In view of the foregoing, an object of the invention is to provide improved digital data 
processing systems and, particularly, improved methods and apparatus of high-speed access to, 
and sharing of, disk drives and other storage devices on a networked computer system. 

A related aspect of the invention is to provide such systems as can be implemented 
with minimimi cost and maximum reliabihty. 

Yet another object of the invention is to provide such systems as can be readily 
adapted to pre-existing data processing systems. 
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Yet still another object of the invention is to provide such systems as can be readily 
integrated with conventional operating system software and, particularly, conventional file 
systems and other input/output subsystems. 
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Summary of the Invention 

The foregoing objects are among those attained by the invention, which provides novel 
methods and apparatus for sharing peripheral devices on a networked digital data processing 
system. 

In one aspect, the invention provides a digital data processing system with improved 
access to information stored on a peripheral device. The system has a plurality of digital data 
processing nodes and a peripheral device. A first node (e.g., a "chent" node) is connected to 
a second node (e.g., a "server" node) over a first communications pathway (e.g., a network). 
The second node is itself connected to the peripheral device (e.g., a disk drive) over a second 
communications pathway. The first node, too, is connected to the peripheral device, over a 
third commimications pathway. 

By way of non-limiting example, the first and second nodes can be a client and server 
networked to one another by Ethernet or other communications media, e.g., in a wide area 
network, local area network, the Internet interconnect, or other network arrangement. The 
server and client can be connected to the peripheral device, e.g., a disk drive, mass storage 
device or other mapped device, via a SCSI channel or other conventional peripheral device 
channel. Preferably, however, they are connected to the peripheral device via a fibre channel, 
"firewire" (i.e., IEEE 1394 bus), serial storage architecture (SSA) bus, high-speed Ethernet 
bus, high performance parallel interface (HPPI) bus or other high-speed peripheral device bus. 

A file system, executing on the first and second nodes, is capable of responding to 
access requests generated by the first node for transferring data between that node and the 
peripheral device, via the second node and via the first and second communications pathways. 
The file system also maintains administrative information pertaining to storage on the 
peripheral device of data designated by such requests. That information includes, for 
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example, physical storage location mappings (or "file maps") for files and other data stored on 
the peripheral device. By way of example, the file system can represent the combined 
functionality of conventional local and network file systems, e.g., on Windows NT or UNIX 
client and server file systems. 

A bypass mechanism, which executes on at least the first node, intercedes in the 
response to at least selected input/output, or access, requests generated by that node. The 
bypass transfers data designated by such requests between the first node and the peripheral 
device over the third communications pathway, in Ueu of transferring that data via the second 
node and the first and second communications pathways. Such transfers by the bypass are 
made using the administrative information maintained by the file system relating to storage of 
such data on the peripheral device. 

By way of example, the bypass can intercede in response to requests by the 
applications programs executing on the first node to read or write data on the peripheral 
device. Rather than permitting the file system to transfer that data via the first and second 
communications pathways, the bypass transfers it directly over the third communications 
pathway to the peripheral device. File mappings governing the physical locations at which 
the data is stored on the peripheral device are obtained from the second node. 

Further aspects of the invention provide digital data processing systems as described 
above in which the bypass obtains such file mappings or other administrative information by 
applying further access requests to the file system. In one such aspect of the invention, the 
bypass issues two such requests. A first request causes the information to be retrieved into an 
actual or virtual ("ghost") file local to the second node. A second request by the first node 
causes that information to be transferred over the network back to the first node. 
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In a related aspect of the invention, the bypass issues an access request, e.g., a file 
write, to a logical unit to which access is controlled by the second node. Data contained in 
that request identifies the file to which the original access request was directed and for which 
mappings are required. The logical unit specified in the further request can be, for example, a 
file on the peripheral device (e.g., other than file to which the original access request was 
directed) or, preferably, a "ghost" file. A second bypass, executing on the second node and 
coupled to the file system resident there, intercedes in response to that request by obtaining 
the file mappings from the second node. This is accomplished, for example, through issuance 
of a request to the local or network file system resident on the second node. The second 
bypass stores that information in the logical unit designated by the first request. 

In further accord with this aspect of the invention, the first bypass issues a still further 
access request, e.g., file read, to the same logical unit. The server bypass can intercede in the 
file system's response to that request, e.g., where the logical unit is a ghost file, by causing 
the file system to pass back file mappings previously stored to the resident data structures. 

In a related aspect of the invention, the client bypasses selectively limits transfers 
between their respective nodes and the peripheral device and, thereby, prevents the nodes 
from "hogging" that resource. Limiting can be accomphshed, for example, using throttling 
limit or other numerical value specifying, e.g., a maximum quantity of data transfer by the 
respective node per unit time. 

Still further aspects of the invention provide a scaleable networked digital data 
processing system comprising first and second nodes configured as described in which the 
first and second nodes are server nodes, each of which is coupled to one or more client nodes. 
Related aspects of the invention provide such a scaleable networked system comprising a third 
server node, itself coupled to one or more client nodes, as well as to the second node over a 
fourth communications pathway. As with the first node, the third node, too, includes a bypass 



Express Mail Label No. EE776165281US 

that responds to requests generated by that node for transferring data designated thereby 
between the third node and the peripheral device over an additional communications pathway. 

Still further aspects of the invention provide methods of operating digital data 
processing systems paralleling the operations described above. 
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Brief Description of the Illustrated Embodiment 

A more complete understanding of the invention may be attained by reference to the 
drawings, in which 

Figure 1 depicts a scaleable networked digital data processing system configured in 
accord with the invention; 

Figure 2 depicts the software architecture of two nodes sharing a peripheral device in 
accord with the invention; and 

Figures 3 and 4 show a messaging sequence illustrating a method of operating a digital 
data processing system in accord with the invention. 
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Detailed Description of the Illustrated Embodiment 

Figure 1 depicts a scaleable networked digital data processing system configured in 
accord with the invention. The system 10 includes a plurality of nodes 12-24, including 
5 two server nodes 18, 20 coupled via network pathways 26, 28 to chent nodes 12 - 16 and 22 
- 24, as shown. Server nodes 18, 20 are additionally coupled to one another via network 
pathway 27. 

In the illustrated embodiment, nodes 12 - 24 represent digital data processing 
10 apparatus or other devices capable of being coupled to one another in a network and, more 

particularly, by way of example, in a chent-server configuration. Illustrated server nodes 18, 

20 represent mainframe computers, workstations, personal computers, or other digital data 
R processing apparatus capable of providing server functions in such networks and, particularly, 

of controlling access to shared peripheral devices, such as storage device 36. Nodes 12-16 
ji and 22 - 24 likewise represent workstations, personal computers, dedicated devices, or other 

digital data processing apparatus that generate requests for access to such shared peripheral 

devices. 

The network pathways 26 - 28 represent wire cable interconnects, wireless 
M interconnects, point-to-point interconnects, Internet interconnects or other digital 

communications interconnects of the type known in the art. Those pathways can be 
configured in any configuration that permits a node 12 - 16, 20 - 24 requesting access to a 
shared peripheral device 36 to communicate that request to a node 18 controlling access 
thereto. For purposes hereof and unless otherwise evident from context, such a requesting 
25 node is referred to as a "chent," regardless of its role (i.e., as a chent or server) in the 
conventional network defined by nodes 12-18 and pathway 26, or nodes 20 - 24 and 
pathway 28. Thus, for example, node 18 could be a "chent" to node 16 for purposes of 
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sharing peripheral device 34, presuming an auxihary connection (e.g., fibre channel) were 
provided between node 18 and that peripheral device. 

In the illustrated embodiment, nodes 12-24 operate under the Microsoft Windows 
NT operating system, though those skilled in the art will appreciate that the nodes 12-24 
may utilize other client and server operating systems, as well. Moreover, it will be appreciate 
that nodes need not utihze the same operating systems. Thus, for example, server 18 may 
operate as a Windows NT-based server, while server 20 operates as a UNIX-based server. 
The invention is therefore seen to have the advantage of permitting multiple nodes of different 
pedigrees, or operating system types, to share a common peripheral device. 

With further reference to Figure 1, the nodes 12-24 are coupled to respective 
dedicated storage devices 30 - 42, as shown. Such couplings are provided by SCSI channels 
or other device interconnects suitable for permitting the nodes to transfer information with 
such devices. In addition to being coupled to their own dedicated storage devices 34, 38, 
nodes 16, 20 are coupled to the storage device 36 that is controlled by node 18. In the 
parlance of the invention, nodes 16, 20 are referred to as "clients" and node 18 is referred to 
as a "server." Coupling between the clients 16, 20 and the shared peripheral device 36 can be 
provided by any conventional peripheral device interconnect, though, preferably, it is provided 
by high-speed interconnects such as fibre channel, "firewire" (i.e., IEEE 1394 bus), serial 
storage architecture (SSA) bus, high-speed Ethemet bus, high performance parallel interface 
(HPPI) bus or other high-speed peripheral device bus. 

Figure 2 depicts farther detail of the hardware and software architecture permitting the 
sharing of peripheral device 36 by nodes 16, 18 in a system according to the invention. 
Though the discussion that follows is directed to sharing among these devices, those skilled in 
the art will appreciate that the teachings can be apphed equally, e.g., to the sharing of device 
38, by nodes 18, 20, the sharing of device 34 by nodes 16, 18, and so forth. Moreover, those 
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teachings can be applied to create a scaleable network. To this end, additional server nodes, 
such as node 20, can be coupled to a common peripheral device 36, as well as to the node 18 
which controls that device, to give still further nodes 22 - 24 to that device 36. Preferred 
uses of the invention are to permit two (or more) network chent nodes, e.g., 14, 16, to share a 
common peripheral device, or to permit two (or more) network servers, e.g., 18, 20, to share 
such a device. 

Referring to the drawing, nodes 16, 18 are coupled to one another via communications 
pathway 26 and to peripheral device 36 via pathways 44, 46, respectively. As noted above, 
pathway 44 (coupling device 18 to peripheral 36) can be a SCSI channel or other 
conventional peripheral device interconnect. Likewise, as noted above, pathway 46 (coupling 
device 16 to peripheral 36) can be a conventional peripheral device interconnect, though, 
preferably, is a high-speed interconnect such as fibre channel, "firewire" (i.e., IEEE 1394 
bus), serial storage architecture (SSA) bus, high-speed Ethernet bus, high performance parallel 
interface (HPPI) bus or other high-speed peripheral device bus. 

Executing on node 16 are one or more appUcations programs 48 (e.g., including video 
editing programs, image analysis programs, and so forth) that generate requests for access to 
local and networked peripheral devices, including shared device 36. Those applications 
programs execute in the conventional manner under the control of an operating system, e.g., 
Windows NT, which includes a file system that services those access requests. 

In the illustration, that file system is represented by elements 50 - 54, including 
"upper" file system 50, representing the Windows NT I/O Subsystem Manager and other 
components responsible for interfacing with applications programs 48 and for routing 
peripheral device access requests to the file system; "lower" file system 52, representing the 
Windows NT File system drivers and intermediate drivers and other components responsible 
for local, disk-based file systems, SCSI drivers and the like providing generic functionality to 
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a common set of devices; and drivers 54, representing software (and hardware) components 
for transferring information to and jfrom attached peripheral devices 34, 36. 

Because node 16 is a cUent vis-a-vis the Windows NT network, the drivers also 
include a network redirector, such as the Windows NT LANManRedirector, that transfers 
access requests to and from the shared peripheral device 36 via server node 18 and pathways 
26 and 44. The node 18, which includes network server component 56, handles such requests 
in the conventional manner of a server of a networked digital data processing system. As 
illustrated, node 18 also includes a file system, comprising elements 58 - 64, whose 
operations parallel those of components 50 - 54 on the node 16. 

Though the illustrated architecture for node 16 is in accord with that dictated by 
Windows NT, those skilled in the art will appreciate that the invention may be embodied in 
devices running under other operating systems, as well. 

General computer networking provides a great many benefits of which a primary is 
having a common/central pool of storage that can be shared in a controlled manner between 
multiple nodes. That sharing takes place over a network communications pathway, such as a 
local area network (LAN), which is usually fast enough for only basic uses. Some uses, such 
as video or graphics or large numbers of connected users, however, can saturate a 
conventional network communications pathway. 

Systems constructed and operated according to the invention allow multiple nodes to 
share a peripheral device without intervening communications pathways or nodes 
bottlenecking selected data transfers. A server node is still used in this "fused drive" 
approach to store physical file mappings and other administrative information - and to 
otherwise administer - the shared device. A direct connection, or "directly attached disk 
connect," is also provided however between each node and the shared device to permit certain 
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operations, e.g., bulk reads and writes, to be accomplished directly between the nodes and the 
peripheral device. The illustrated embodiments provide this capability through 
communications pathways such as pathway 46 and through filter drivers 66, 68 incorporated 
into the file systems. Among the many advantages of these systems is that they provide orders 
of magnitude performance improvements at virtually no additional hardware costs. 

In the discussion that follows, systems according to the invention are variously referred 
to as "fused drive systems", "fused drive technology", "fused drive", and the like. Unless 
otherwise evident from context, the term "file system" refers to the combined operation of the 
nodes' native file systems (e.g., comprising elements 50 - 54 and 56 - 64) and of the network 
server, e.g., 56., and file system, e.g., 56 - 64, of the node to which the shared peripheral 
device is assigned and of the file system, e.g., 50 - 54 of the node that shares that device. 

Networking 

To facilitate implementation of the invention, it is preferable that the underlying digital 
data processing system have at least basic networking capabilities that allows for the concept 
of "file server". Windows NT provides such an architecture. Programs can access files either 
locally or remotely through a variety of Windows general file specifications (e.g. "C:t.txt" 
where "C" is a local drive, or "H:t.txt' where "H" is mapped network drive, or 
"\\server\share\t.txt", where 'server' is another computer). This abstraction is successful 
because the apphcations software 48 need not know about the specifics of where a file resides 
in order to be able to access it through a well defined and globally supported set of 
commands. 

File map 

In order to correctly intercept file reads and writes (to provide for acceleration), it is 
preferable to know exactly how each file is physically laid out on the shared peripheral 
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device, e.g., disk 36. Application programs 48 make requests based on logical file blocks. 
The file system presents to an application 48 an abstraction of a file that appears to be a 
series of contiguous data blocks. In reality, the file system allocates physical pieces of the 
disk drives separately and knits them together in a variety of file system specific trees and 
directories and maps (and other structures). While a logical-block x might reside on physical 
block y, block x+1 might live an entirely different area. 

As with other operating systems, Windows NT provides functions for interrogating the 
physical allocation map of a file. These are provided primarily for the handful of programs 
that actually need to understand such things (e.g., disk defragmenters). Without this map, it 
might be possible to build a map of file blocks by understanding the NTFS data structures and 
traversing many native file structures. 

Volume Serial Number 

Systems according to the invention provide two paths to the shared storage, e.g., 
device 36: one path is via the directly attached media wire, e.g., 46, and the other path is 
through conventional networking, e.g., 26, to the other node, e.g., 18, that has direct access. 
"Fusing" a drive in accord with the invention requires intercepting important I/O commands 
an issuing them locally. But to which local device? 

Each disk volume, e.g., 36, has a "volume serial number" that NT places on the 
volume. When an appUcations program 48 or user makes a request to "fuse" a volume, the 
illustrated system locates the correct locally attached volume by examining the volume serial 
number. 

File System Filters 

Most modem operating systems are put together in well defined layers and those layers 
are allowed to have well defined pieces (developed by "third parties") plug into them to 

14 



Express Mail Label No. EE776165281US 

extend functionality. The most common example is a add-on board, e.g. a video board, and 
needs some special software to handle the board details yet presents a standard interface to the 
rest of the operating system so that software written to the generic abilities will work with the 
new device. 

Windows NT at a very fundamental layer abstracts all devices; that is, any device (be 
it a physical adapter or a software device) winds up having an entity in the NT system known 
as a "device". A device supports a well defined interface and all implementations of a device 
must provide appropriate code for the planned interface points. 

What is significant is that NT supports "layering" a device. This means providing an 
ability to intercept any and all transactions to a device and possibly introduce additional 
processing. A layered device driver can do any of the following per function: handle the 
function entirely itself and dismiss it; do some pre-processing and then let tiie natural fimction 
take over; do post processing (after the natural fimction). 

The illustrated embodiment exploits this structuring opportunity. That is, in general the 
goal is to let as much of regular processing happen so that all the natural benefits are realized 
(security, sharing, etc), and only accelerate certain important features. 

The NT function lOAttachDevice is used to perform the layering. 

NT Device types 

There are many different extensions of the base "driver" class in NT. The following 
are the ones that are interesting for solving this problem: 
FileSystemDevice 
FileSystemRecognizer 
LANMANRedirector 
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NT typically supports two main file system types: NTFS and FAT. Each "volume" in 
the NT system (e.g. "C:") is an instance of a FileSystemDevice. NT provides a generic file 
sub-system interface for applications (e.g. open(), read(), write()). NT will then call the 
appropriate file system- specific routines based on the target. There is a "class" (a driver) for 
each type of file system present (e.g. NTFS class, FAT class, etc), and a separate "instance" 
of that class for each and every volume (e.g. C:, D:, E:). 

There is a pseudo device called the FileSystemRecognizer. Whenever NT mounts a file 
system (typically at boot time), NT determines which class needs to manage it, and then 
provides that class an opportunity to deal with it. NT provides a mechanism to inform other 
system elements of this mounting (or unmounting). That mechanism is through the pseudo 
device FileSystemRecognizer. One of the particular entry points for the FileSystemRecognizer 
provides details regarding volumes that are being mounted or dismounted. Thus, a layered 
driver on top of this device could then be made aware of such events taking place. 

It is preferable for the illustrated embodiment to intercept and thus know about all 
system mount/unmount activity. Information such as the volume label and physical device 
elements is important to note for subsequent tasks. 

On the "client" side, e.g., on the node 16 that accesses the shared peripheral device but 
that does not inherently control it, the driver 66 intercepts volume mounts and dismounts. It 
may disallow some volumes from mounting, e.g., perhaps due to security settings, or other 
issues such as multi-writers. If allowed to mount, the driver 66 will capture the volume label 
(and signature which is volume unique) and other pertinent information. This data is stored in 
a globally available area the server driver 68 can use when necessary. During a dismount, 
this information is discarded. 
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For whatever reason, network mapped devices (which appear as a mounted file system 
to the casual NT user), do not go through this mount/dismount notification process. The 
illustrate embodiment needs to be aware of network mounted file systems so that it can have 
an opportunity to accelerate some of their fimctions. For this purpose, it layers on top of the 
separate driver class called LANMANRedirector that essentially manages NT's LAN Manger 
mounted volumes (which is the typical network file system mounter that comes with NT). 

Overall Flow Summary 

In general networking there is typically a concept of a "server" machine and many 
clients that are attached to that server. The server ''serves" tasks to the clients and one of the 
most typical tasks is serving file data. Mapped file systems allow a client to pretend that a 
disk that is physically mounted on a "server" machine to seem mounted on the client as well. 
The network and operating systems will cooperate to manage that illusion and transparently 
feed data back and forth between the machines. 

Though the invention can be implemented to achieve a client-server relationship 
between the devices, e.g., 16, 18, that share a peripheral device, e.g., 36, it does not 
necessitate such a relationship. That is, there is no requirement for a single machine to be the 
data master of all transactions. However, there is a concept of a master writer, e.g., node 18, 
and for simplicity sake this document will refer to that machine as the "server" for a 
particular volume. 

In the illustrated embodiment, all participating machines, e.g., 16, 18, have a direct 
connection to the shared peripheral device, e.g., 36, and thus they all have the opportunity to 
mount the volumes as local volumes (and conceivably at the same time). Because of cache 
coherency issues, it is necessary to insist that only a single machine be a real "writer" (server) 
of a volume at a time and that proper cache flushing take place between ownership changes. 
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The illustrated embodiments route some requests through the regular networking 
technology and others are by-passed and go directly to the attached storage interface. Thus, 
accessing the shared device, e.g., 36, will in fact result in some amount of network activity 
and thus some amount of impact on another machine (that is "serving" the interface to that 
storage volume). 

The term "server", as used herein and unless otherwise evident from context, means 
"the machine that is serving the needs regarding a particular volume", but does not mean "the 
machine that serves all the needs of a set of volumes" necessarily. A peer-to-peer arrangement 
is both possible and beneficial wherein each participant is a primary writer (server) for some 
volumes, and a chent for the others. The reality is that the number of bytes transferred over 
the conventional network is insignificant in comparison to the number "accelerated" to the 
direct attached interface. 

Preliminary Configuration Issues 

In order to implement the invention, it is required that all participating nodes, e.g., 16, 
18, be coupled to the shared device, e.g., 36, preferably by a direct attached interface, such as 
Fibre Chaimel, or the like, and that they can "mount" that device locally. It is also required 
that all participants, e.g., 16, 18, be in communication with each other over regular 
networking, e.g., 26, and that the regular network protocols be present that allow for 
mounting remote file systems. 

The illustrated embodiment receives a request to "fuse" a particular volume, e.g., 
device 36, from the user or from an applications program 48. In order to proceed with this, it 
must know which node is to be the real "server" of that volume. 
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Booting 

When each node, e.g., 16, 18, is first booted, the local file system filter driver, e.g., 
66, 68, is loaded and "layered" on top of the FileSystemRecognizer and is thus apprised about 
the mounting/uiunounting of volumes. Each time a volume is mounted, it will record 
information about that volume for possible later use (e.g., in a global data structure) and also 
install the filter for that particular FileSystemDevice. The driver, e.g., 66, also layers on top 
of the LANMANRedirector and is thus apprised of all activities related to mapped drives. 
The mounting of a volume that will be "fused" in accord with the invention is shown in 
Figure 3, step (1). 

When a "make fixsed" request comes in (e.g., from a configuration program), the local 
filter driver, e.g., 66, takes the following actions: 

1) It issues a request to the resident portion of the file system to create a mapped 
device for it (e.g. M:); 

2) When the file system completes the mapping, it performs "mount" and the local 
driver, e.g., 66 receives control (due to the layering on the 
LANMANRedirector); 

3) The local filter driver then checks the specifics of this network volume being 
mounted (specifically the volume signature) and compares that to all local 
volumes that it has access to. If it finds a match, then it knows it has an 
alternative (and higher performance path) to that same storage and records this 
information in global structures for subsequent transactions to exploit. 

The aforementioned actions are represented in Figure 3, as step (3). 
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Network mapping 

The natural course is for there soon to be some accesses of this newly created network 
device, e.g., peripheral device 26. All accesses of all network devices go through the filter 
driver, e.g., 66, that is layered on the LANMANRedirector. 

"CreateFileQ" is the Windows function for opening files (among other things) and is a 
required first step before files can be read or written. The opening of a file on the fused 
drive is indicated by step (4) in Figure 3. 

When CreateFile is called on client node 16, the client filter driver 66 interrogates the 
specifics of the file name and determines whether the file being accessed lives on a volume 
that is "fused". If so, the filter driver 66 needs to procure the information about how the file 
is physically laid out on the device so that subsequent read/write calls can go directly to the 
shared peripheral 36. 

Referring to step (5) of Figure 3, it gathers this information by sending a request to 
the real "writer" (server), e.g., 18, of that volume. Specifically, the filter driver 66 writes a 
request into a specific file (the "ghost" file) on an actual or virtual device controlled by the 
server, e.g., 18, That write request includes the name of the file referenced in the intercepted 
CreateFileO call. In a preferred embodiment, the aforementioned ghost file is so named 
because it doesn't really exist; rather, accesses to and from it on the server side are 
intercepted by the filter driver 58. 

Server side 

The filter driver, e.g., 68, resident on the server, e.g., 18, detects the request and 
determines the details of the file layout. As shown in step (6) of Figure 3, it first obtains the 
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name of the file by reading the ghost file. In step (7), it then calls a Windows NT function 
that is provided for defragmenter packages and other disk-optimizing utilities to obtain the file 
layout. To insure coherency, whenever a file map is requested, the server driver 68 also issues 
a local "purge" call to remove any material regarding that file in its own local caches. 

In step (8), the server filter driver 68 then writes that information back to the ghost 
file. This, in turn, is read by the client filter driver 66. See step (9). When the map comes 
back, the chent filter driver 66 stores in a global data structure to be exploited by subsequent 
read and write calls. 

Reads and Writes 

In step (10) of Figure 4, an appUcations program 48 on the chent node 16 attempts to 
read a file on the shared device 36. Upon intercepting that request, the client filter driver 66 
interrogates the global data structures to determine if the request is directed to a file for which 
mapping information is present. If so, previously stored the map is interrogated to determine 
the physical blocks needed from the device 36 to fulfill the request. The blocks are then read 
via the direct interconnection pathway, e.g., 46, and the request is dismissed (and no network 
transactions take place). See step (11). 

As shown in steps (12) - (13), a similar sequence is effected when an applications 
program 48 on the client node 16 attempts to write a file on the shared device, e.g., 36. 
However, any time such a file's allocation is extended, the allocation request is permitted to 
take the normal network path (e.g., via network communications pathway 26) for servicing by 
the network server 56 and file system resident on the server node 18. Once that is completed, 
another mapping is obtained as described above. In one embodiment of the invention, the 
client filter driver 66 forces any such allocations to be much larger so that they happen less 
frequently. File writes then can take place in the same way that file reads do (using the map 
to determine the actual blocks to locally write). 
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In a preferred embodiment of the invention, the client filter driver 66 can limit the 
respective node's access to the shared device 36 via the auxihary pathway 44. This prevents 
the node 16 from "hogging" the device 36 to the exclusion of the other nodes. Limiting can 
be accomphshed by tracking the number of bytes transferred, or the number of accesses made, 
by node 16 over pathway 44 per imit time. If that number exceeds a predetermined numerical 
"throttling" limit, the driver 66 can delay responding to a newly intercepted request or route it 
for handling by the server 18 as a conventional network request. 

Normal activities 

Apart from file reads and writes, the filter drivers 66, 68 permit all other file activities 
to go through the normal networking interfaces. These include security checks, time and date 
stamps, directory look ups, etc. Those skilled in the art will appreciate that reads and writes 
account for the grand majority of the total number of bytes that would have passed through 
the network and, hence, by handling separately in the manner discussed above, the invention 
achieves great speed increases. 

Specifics of startup 

The filter drivers 66, 68 are essentially device drivers and are started in the normal 
operating system device context (system boot time). This is the desired time for the software 
to "hook" (layer) into the other devices that it needs to have control over. 
The drivers 66, 68 can correctly layer into the FileSystemRecognizer device at this time, but 
caimot do so for the LANMANRedirector (due to some internal NT restrictions). So, 
accompanying software start up a system thread that politely "polls" for the existence of the 
LANMANRedirector (which will happen shortly after boot) and, once discovered, perform the 
normal layering. 
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An important part of the driver filter 66, 68 operation at boot time is to prevent any of 
the shared storage drives from mounting. In normal situations, NT attempts to mount all 
directly attached storage (and knows nothing natively about the fact that this storage may be 
already in use or mounted on another system). If NT were allowed to mount a shared device, 
and there were "writes" taking place to that drive from another system, NT would be 
confused and would attempt a "rollback" operation, wiping out any data written recently 
written (this is a recovery procedure that needs to be done in the event of an abrupt shutdown 
and is inappropriate in a shared disk environment). 

Described herein are methods and apparatus meeting the objects set forth above. 
Those skilled in the art will appreciate that the illustrated embodiment is shown and described 
merely by way of example and that other embodiments incorporating changes therein fall 
within the scope of the invention, of which we claim: 
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A digital data processing system with improved access to information stored on a 
peripheral device, comprising 

first and second nodes coupled to one another over a first communications pathway, 

the second node being coupled to a peripheral device over a second communications 
pathway, 

the first node being coupled to the peripheral device over a third communications 
pathway, 

a file system, executing on the first and second nodes, being capable of responding to 
access requests generated by the first node, for 

(i) transferring data designated by the request between the first node and the 
peripheral device via the second node and via the first and second 
communications pathways, 

(ii) maintaining administrative information pertaining to storage of the data 
designated by the request on the peripheral device, 

a first bypass, executing on at least the first node, for interceding in response to at 
least a first selected access request appHed thereby to the file system, by transferring 
data designated by that request between the first node and the peripheral device over 
the third communications pathway in accord with administrative information 
maintained by the file system pertaining to storage of that data on the peripheral 
storage device. 
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A digital data processing system according to claim 1, wherein the transfer of data by 
the first bypass between the first node and the peripheral device obviates transfer of 
such data by the file system between the first node and the peripheral device. 

A digital data processing system according to claim 1, wherein the first bypass 
transfers the designated data over the third communications pathway to physical 
storage locations on the peripheral device determined fi*om the administrative 
information for that data maintained by the second node. 

A digital data processing system according to claim 3, wherein the first bypass obtains 
from the second node administrative information fi-om which can be determined 
physical storage locations in the peripheral device for the data designated by the first 
request. 

A digital data processing system according to claim 4, wherein the first bypass at least 
initiates obtaining administrative information from the second node by generating and 
applying to the file system a second access request, the second request being for access 
to a logical unit to which access is controlled by the second node. 

A digital data processor according to claim 5, wherein the file system transfers the 
second access request to the second node over the second communications pathway. 

A digital data processor according to claim 5, wherein the second request is for access 
to a logical unit other than a file designated in the first access request. 

A digital data processing system according to claim 7, wherein the logical unit is a file 
that resides on a peripheral device local to the first node. 
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A digital data processor according to claim 8, wherein the request generated by the 
first bypass is a request to write and a file. 



10. A digital data processing system according to claim 7, comprising a second bypass, 
executing on the second node, for responding to the second access request by obtaining 
fi*om the second node, and for at least temporarily retaining, a physical mapping for a 
file designated by the first access request. 

11. A digital data processing system according to claim 10, wherein the first bypass 
obtains the physical mapping for the file designated by the first access request by 
generating and applying to the file system a third access request for access, the third 
request designating the logical unit designated by the second access request. 

12. A digital data processing system according to claim 4, wherein 

A. the first bypass generates and applies to the file system a second access request, the 
second access request being a request for writing a logical unit an identifier of a file 
designated by the first access request, the logical unit being a file to which access is 
controlled by the second node but which is other than the file designated in the first 
access request, 

B. the second bypass responds to the second access request by reading the logical unit to 
determine the identifier, obtaining fi-om the second node a physical mapping for a file 
designated by that identifier, and writing that physical mapping to the logical unit, and 

C. the first bypass generates and applies to the file system a third access request for 
access, the third request being a request for the physical mapping from the logical unit. 
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13. A digital data processing system according to claim 1, wherein the first bypass 
selectively limit transfers between the first node and the peripheral device over the 
third communications pathway. 

14. A digital data processing system according to claim 13, wherein the first bypass 
selectively limits such transfers to facilitate access to the peripheral device by a node 
other than the first node. 

15. A digital data processing system according to claim 14, wherein the bypass Umits such 
transfers in accord with a predefined throtthng limit. 

16. A digital data processing system according to claim 1, wherein the first 
communications pathway comprises any of a wide area network interconnect, local 
area network interconnect, internet interconnect, or other network interconnect. 

17 A digital data processing system according to claim 1, wherein the third 

communications pathway comprises any of a fibre channel, a firewire bus, a serial 
storage architecture (SSA) bus, high-speed Ethernet bus, high performance parallel 
interface (HPPI) bus, and other high-speed peripheral device bus. 

18, A digital data processing system with improved file access, comprising 

A. first and second nodes coupled to one another over a first communications pathway, 

B. the second node being coupled to a peripheral device over a second communications 
pathway, 
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the first node being coupled to the peripheral device over a third communications 
pathway, 

an applications program executing on the first node generating requests for access to 
files stored on the peripheral device, and for applying those requests to a file system, 

the file system, executing on the first and second nodes, being capable of responding 
to access requests generated by the first node, for 

(i) transferring data designated by file access requests between the first node and 
the peripheral device via the second node and via the first and second 
communications pathways, 

(ii) maintaining administrative information pertaining to physical mappings for 
files stored on the peripheral device, 

a first bypass, executing on at least the first node and coupled to the file system, for 
transferring between the first node and the peripheral device over the third 
commimications pathway data designated by at least selected file access requests, the 
first bypass transferring such data based on administrative information pertaining to 
physical mappings maintained by the file system, such transfers being in heu transfer 
of that data via the second node and via the first and second communications 
pathways. 

A digital data processing system according to claim 18, wherein the first bypass 
initiates obtaining administrative information pertaining to physical mappings from the 
second node by generating aad applying to the file system a second access request, the 
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second request being directed to a file for which access is controlled by the second 
node and which is not a file designated in the first access request. 

A digital data processing system according to claim 19, wherein the second access 
request includes an identity of the file designated in the first access request. 

A digital data processing system according to claim 19, comprising a second bypass, 
executing on the second node, for respondmg to the second access request by obtaining 
fi-om the second node, and for at least temporarily retaining, a physical mapping for a 
file designated by the first access request. 

A digital data processing system according to claim 21, wherein the first bypass 
obtains the physical mapping for the file designated by the first access request by 
generating and applying to the file system a third access request for access, the third 
request designating the logical unit designated by the second access request. 

A digital data processing system according to claim 22, wherein the third access 
request is directed to the same file as the second access request. 

A digital data processing system according to claim 19, wherein 

the first bypass generates and applies to the file system a second access request, the 
second access request being a request for writing a logical unit an identifier of a file 
designated by the first access request, the logical unit being a file to which access is 
controlled by the second node but which is other than the file designated in the first 
access request. 
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B. the second bypass responds to the second access request by reading the logical unit to 
determine the identifier, obtaining from the second node a physical mapping for a file 
designated by that identifier, and writing that physical mapping to the logical unit, and 

C. the first bypass generates and apphes to the file system a third access request for 
access, the third request being a request for the physical mapping from the logical unit. 

25. A digital data processing system according to claim 24, wherein at least one of the 
first bypass stores in a data structure physical mappings for the file designated by the 
first access request. 

26. A digital data processing system according to claim 24, wherein the second bypass 
flushes any write cache for the file designated by the first access request in connection 
with obtaining physical mappings for that file from the second node. 

27. A scaleable networked digital data processing system with improved access to 
information stored on a first peripheral device, comprising 

A. a plurality of networked nodes, including at least first and second server nodes, 
coupled to one another via a first communications pathway, 

B. the second server node being coupled to a first peripheral device over a second 
communications pathway, 

C. the first server node being coupled to the first peripheral device over a third 
communications pathway, 



30 



Express MaU Label No. EE776165281US 

D. each of the first and second server nodes being coupled to zero, one or more cUent 
nodes, 

E. a file system, executing on at least the first and second server nodes, being capable of 
responding to access requests generated by the first server node, for 

(i) transferring data designated by the request between the first server node and 
the first peripheral device via the second server node and via the first and 
second communications pathways, 

(ii) maintaining administrative information pertaining to storage of the data 
designated by the request on the first peripheral device, 

E. a first bypass, executing on at least the first server node, for interceding in response to 
at least a first selected access request appUed thereby to the file system, by transferring 
data designated by that request between the first server node and the first peripheral 
device over the third communications pathway in accord with administrative 
information maintained by the file system pertaining to storage of that data on the 
peripheral storage device. 

28. A scaleable networked digital data processing system according to claim 27, further 
comprising 

A. a third server node coupled to at least the second server node via a fourth 
communications pathway, 

the file system, executing on at least the second and third server nodes, being capable 
of responding to access requests generated by the third server node, for 
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(i) transferring data designated by the request between the third server node and 
the first peripheral device via the second server node and via the second and 
fourth communications pathways, 

(ii) maintaining administrative information pertaining to storage of the data 
designated by the request on the first peripheral device, 

a second bypass, executing on at least the third server node, for interceding in response 
to at least selected access requests applied thereby to the file system, by transferring 
data designated by that request between the third server node and the first peripheral 
device over the fifth communications pathway in accord with administrative 
information maintained by the file system pertaining to storage of that data on the 
peripheral storage device. 

A method for improved access to a peripheral device in a digital data processing 
system of the type having 

first and second nodes coupled to one another over a first communications 
pathway, 

the second node being coupled to a peripheral device over a second 
communications pathway, 

a file system, executing on the first and second nodes, being capable of 
responding to access requests generated by the first node, for (i) transferring 
data designated by the request between the first node and the peripheral 
device via the second node and via the first and second communications 
pathways, (ii) maintaining administrative information pertaining to storage of 
the data designated by the request on the peripheral device, 
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the method comprising: 

A. coupling the first node to the peripheral device over a third communications pathway, 
and 

B. bypassing the file system, in part, in responding to at least a first selected access 
request applied by the first node to the file system, such bypassing including 
transferring data designated by that request between the first node and the peripheral 
device over the third communications pathway in accord with administrative 
information maintained by the file system pertaining to storage of that data on the 
peripheral storage device. 

30. A method according to claim 29, wherein the bypassing step obviates transfer via the 
second node and via the first and second communications pathways of data transferred 
between the first node and the peripheral device over the third communications 
pathway. 

32. A method according to claim 29, wherein the bypassing step comprises transferring the 
designated data over the third communications pathway to physical storage locations 
on the peripheral device determined fi-om the administrative information for that data 
maintained by the second node. 

33. A method according to claim 31, wherein the bypassing step comprises at least 
initiating the obtaining of administrative information fi:om the second node by 
generating and applying to the file system a second access request, the second request 
being for access to a logical unit to which access is controlled by the second node. 
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34. A method according to claim 32, wherein the second request is for access to a logical 
unit other than a file designated in the first access request. 

33. A method according to claim 34, wherein the logical unit is a file that resides on a 
peripheral device local to the first node. 

36. A method according to claim 35, wherein the request generated by the first bypass is a 
request to write a file. 

37. A method according to claim 32, comprising responding to the second access request 
by obtaining from the second node, and at least temporarily retaining on any of the 
second node and the peripheral device, a physical mapping for a file designated by the 
first access request. 

38. A method according to claim 36, wherein the bypassing step comprises transferring, to 
the first node, the physical mapping for the file designated by the first access request 
by generating and applying to the file system a third access request for access, the 
third request designating the logical unit designated by the second access request. 

39. A method for improved access to a peripheral device in a scaleable networked digital 
data processing system of the type having 

a plurality of networked nodes, including at least first and second server 
nodes, coupled to one another via a first communications pathway, 

the second server node being coupled to a first peripheral device over a 
second communications pathway. 
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each of the first and second server nodes being coupled to zero, one or more 
client nodes, 

a file system, executing on at least the first and second server nodes, being 
capable of responding to access requests generated by the first server node, 
for (i) transferring data designated by the request between the first server 
node and the first peripheral device via the second server node and via the 
first and second communications pathways, (ii) maintaining administrative 
information pertaining to storage of the data designated by the request on the 
first peripheral device, 

the method comprising the steps of 

coupling the first server node to the first peripheral device over a third 
communications pathway, 

bypassing the file system, in part, in responding to at least a first selected access 
request applied by the first server node to the file system, such bypassing including 
transferring data designated by that request between the first server node and the 
peripheral device over the third communications pathway in accord with administrative 
information maintained by the file system pertaining to storage of that data on the 
peripheral storage device. 

A method according to claim 39, further wherein the networked digital data processing 
is of the type having 

a third server node coupled to at least the second server node via a fourth 
communications pathway, 
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a file system, executing on at least the second and third server nodes, being 
capable of responding to access requests generated by the third server node, 
for (i) transferring data designated by the request between the third server 
node and the first peripheral device via the second server node and via the 
second and fourth communications pathways, (ii) maintaining administrative 
information pertaining to storage of the data designated by the request on the 
first peripheral device, 

the method comprising 

A. couphng the third server node to the first peripheral device over a fifth 
communications pathway, 

B. bypassing the file system, in part, in responding to at least a first selected access 
request appHed by the third server node to the file system, such bypassing including 
transferring data designated by that request between the third server node and the 
peripheral device over the fifth communications pathway in accord with administrative 
information maintained by the file system pertaining to storage of that data on the 
peripheral storage device. 



36 



Abstract of the Invention 



Express Mail Label No. EE776165281US 



The invention provides a digital data processing system with improved access to 
information stored on a peripheral device. The system has a pluraUty of nodes, a peripheral 
5 device, a file system and a bypass mechanism. A first node (e.g., a chent node) is connected 
to a second node (e.g., a server node) over a first communications pathway (e.g., a network). 
The second node is itself connected to a peripheral device (e.g., a disk drive) over a second 
communications pathway. The first node, too, is connected to the peripheral device over a 
third communications pathway. The file system, executing on the first and second nodes, is 

10 capable of responding to access requests generated by the first node for transferring data 
between that node and the peripheral device, via the second node and via the first and second 
communications pathways. The file system also maintains administrative information 
pertaining to storage on the peripheral device of data designated by such requests. That 
information includes, for example, physical storage location mappings for files and other data 

15 stored on the peripheral device. The bypass mechanism, which executes on at least the first 
node, intercedes in the response to at least selected input/output, or access, requests generated 
by that node. The bypass transfers data designated by such requests between the first node 
and the peripheral device over the third communications pathway, in lieu of transferring that 
data via the second node and the first and second conmiunications pathways. Such transfers 

20 by the bypass, however, are made using the administrative information maintained by the file 
system relating to storage of such data on the peripheral device. 

Exchange.3006236.1 
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Express Mail Label No. EE776165281US 
Attorney Docket No. 0496747-0019 



DECLARATION, PETITION 
AND 

POWER OF ATTORNEY FOR PATENT APPLICATION 

As a below named inventor, I hereby declare that: 

My residence, post office address and citizenship are as stated below next to my name, 

I believe I am the original, first and sole inventor (if only one name is listed below) or an 
origmal, first and joint inventor (if plural names are listed below) of the subject matter which 
is claimed and for which a patent is sought on the mvention entitled 

METHODS AND APPARATUS FOR HIGH-SPEED 
ACCESS TO AND SHARING OF STORAGE DEVICES 
ON A NETWORKED DIGITAL DATA PROCESSING SYSTEM 



the specification of which 
(check one) 

X is attached hereto 

filed on 

as Application Serial No. 

or Express Mail No. 

and was amended on • 

(if applicable) 

I do not know and do not believe that the subject matter of this application was ever known or 
used in the United States before my invention thereof or patented or described in any printed 
publication in any country before my invention thereof or more than one year prior to the date 
of this application, and that said subject matter has not been patented or made the subject of an 
issued inventor's certificate in any country foreign to the United States on an application filed 
by me or my legal representatives or assigns more than twelve months prior to the date of this 
application; that I acknowledge my duty to disclose mformation of which I am aware which is 
material to the examination of this application, that no application for patent or inventor's 



certificate on the subject matter of this application has been filed by me or my representatives 
or assigns in any country foreign to the United States, except those identified below, and that 
I have reviewed and understand the contents of the specification, including the claims as 
amended by any amendment referred to herein. 

I acknowledge the duty to disclose information which is material to the exammation of this 
application in accordance with Title 37, Code of Federal Regulations, §L 56(a). 

PRIORITY CLAIM 

I hereby claim foreign priority benefits under Title 35, United States Code, §119 of any foreign 
application(s) for patent or inventor's certificate or of any PCT international application(s) 
designating at least one country other than the United States of America Usted below and have also 
identified below any foreign application(s) for patent or inventor's certificate or any PCT 
international application(s) designating at least one country other than the United States of 
America filed by me on the same subject matter having a filing date before that of the 
application(s) of which priority is claimed. 

Check one: 

no such applications have been filed. 

such applications have been filed as follows 



EARLIEST FOREIGN APPLICATION(S), IF ANY FILED WITHIN 12 MONTHS 
(6 MONTHS FOR DESIGN) PRIOR TO THIS U.S. APPLICATION 



Country 


Application Number 


Date of Filing 
(month, day ,year) 


Priority Claimed 
Under 35 USC 119 








_ Yes 


No_ 








_ Yes 


No_ 








_ Yes 


No_ 









_ Yes No _ 








Yes No __ 
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ALL FOREIGN APPLICATION(S), IF ANY FILED MORE THAN 12 MONTHS 
(6 MONTHS FOR DESIGN) PRIOR TO THIS U.S. APPLICATION 



CLAIM FOR BENEFIT OF EARLIER U.S./PCT APPLICATION(S) 

I hereby claim the benefit under Title 35, United States Code, §120 of any United States 
application(s) listed below and, insofar as the subject matter of each of the clahns of this 
application is not disclosed in the prior United States application in the manner provided by the 
first paragraph of Title 35, United States Code, §112, 1 acknowledge the duty to disclose material 
information as defined in Title 37, Code of Federal Regulations, §1. 56(a) which occurred between 
the filing date of the prior application and the national or PCT international filing date of this 
application. As to subject matter of this application which is common to my earlier United States 
application, if any, described below, I do not believe that the same was ever known or used in 
the United States before my invention thereof or patented or described m any printed publication 
in any country before my mvention thereof or more than one year prior to said earlier application, 
or in public use or on sale in the United States more than one year prior to said earlier 
application, that the said common subject matter has not been patented or made the subject of an 
inventor's certificate issued before the date of said earlier application in any country foreign to 
the United States on an application, filed by me or my legal representatives or assigns more than 
twelve months prior to said application and that no application for patent or inventor's certificate 
on said subject matter has been filed by me or my representatives or assigns in any country 
foreign to the United States except those identified hereui. 



I i (Filed) i (patented.pending, 

! i i abandoned) 

i U.S. Application 09/022,266 1 12/31/97 | PENDING 



POWER OF ATTORNEY 



As a named inventor, I hereby appoint the following attorneys and/or agents to prosecute this 
application and transact all business in the Patent and Trademark Office connected therewith. 



Send Correspondence to: 

CHOATE. HALL & STEWART. Exchange Place. 53 State Street. Boston. M A 02109-2891 



Direct Telephone Calls to: (name and telephone number) 

David J. Powsner. Esq.. Tel.: (617^ 248-4030 . 

Wherefore I petition that letters patent be granted to me for the invention or discovery described and 
claimed m the attached specification and clauns, and hereby subscribe my name to said specification 
and claims and to the foregoing declaration, power of attorney, and this petition. 

I hereby declare that all statements made herein of my own knowledge are true and that all statements 
made on information and belief are believed to be true; and further that these statements were made 
with the knowledge that willful false statements and the like so made are punishable by fine or 
imprisonment, or both, under Section 1001 of Title 18 of the United States Code and that such willful 
false statements may jeopardize the validity of the application or any patent issued thereon. 



Full name of second inventor Christopher J. Stakutis 
Inventor's signatmre 



David J. Powsner 



Mary Rose Scozzafava 
Brenda H. Jarrell 



Sam Pasternack 



Mary Raynor Jimenez 



Reg. No. 31,868 
Reg. No. 29,576 
Reg. No. 36,268 
Reg. No. 39,223 
Reg. No. 37,219 



Date 



Residence of second inventor 

Citizenship U.S. 

Post Office address same 
(if different) 



Full name of second inventor 


Kevin M. Stearns 


Inventor's signature 


Date 


Residence of second inventor 


Citizenship 


U.S. 


Post Office address 


same 


(if different) 
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